Data processing device and data processing method for creating a file in a specified format or outputting a file in a determined format

ABSTRACT

A data processing device for incorporating one or more scanned data files generated by an image scanning device for storage into a file in a specified format (e.g. an Excel file) equipped with a plurality of display areas (e.g. spreadsheets), which is capable of receiving user&#39;s designation of one of the display areas to which each of the scanned data files is allocated to (S 101 ), and modifying configuration data (e.g. part files) of the file so that each of the scanned data files is allocated to one of the display areas according to the user&#39;s designation (S 112 ).

CROSS-REFERENCE TO RELATED APPLICATION

This application is based on Japanese Patent Applications No.2010-238861 filed on Oct. 25, 2010 and No. 2010-268194 filed on Dec. 1,2010 the contents of which are incorporated herein by reference.

BACKGROUND

1. Technical Field

The present invention relates to a data processing device and a dataprocessing method for retrieving image data such as scanned data forstorage in a specified format.

2. Description of Related Arts

The multiple functions of recent scanners have increased user's freedomof specifying the saving format for scanned data generated by thesescanners. For example, Japanese Patent Application Publication No.2006-146486 discloses an MVP capable of dividing a document image intosmall objects such as texts, illustrations, photographs, and tables bymeans of vector-scanning, and storing them in an user's specified fileformat for each object type.

In this context, various efforts have been made for effective use ofscanned data, most of recent scanners are provided with a function toretrieve scanned data in JPEG format for storage in a specified fileformat such as Microsoft Word (registered trademark) and Microsoft Excel(registered trademark). These scanners are capable of receiving user'sdesignation of a file format for storage, but their data storageoperation will only result in random relocation of the scanned datawithin the designated format file. For example, if Excel is designatedas a storage file format, the entire scanned data will be stored into asingle spreadsheet (See FIG. 18). Consequently, the user will inevitablygo through relocation of the scanned data within the file if he/sheintends to make its effective use, and this means that user's burden inretrieving scanned data for storage has not been reduced as much asdesired.

SUMMARY

In order to achieve at least one of the objects mentioned above, thedata processing device for incorporating one or more scanned data filesgenerated by an image scanning device for storage into a file in aspecified file format equipped with a plurality of display areas, whichreflect one aspect of the present invention, comprises: a reception unitfor receiving user's designation of one of said display areas to whicheach of said scanned data files is allocated; and a data modificationunit for modifying configuration data of said file so that each of saidscanned data files is allocated to one of said display areas accordingto said user's designation received by said reception unit.

The data processing device which reflects another aspect of theinvention comprises: an acquisition unit for acquiring image data; adetermination unit for determining whether or not image layout of saidacquired image data matches with image layout of a predeterminedtemplate by comparing said acquired image data with template data areaof said template in order to determine whether or not said acquiredimage data fits into said template data area; and a control unit forgenerating vector data from said image data, and controlling said dataprocessing device so that said generated vector data is output in imagelayout of said template with which said determination unit determinesthat image layout of said acquired image data matches.

The objects, features, and characteristics of the present inventionother than those set forth above will become apparent from thedescription given herein below with reference to preferred embodimentsillustrated in the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram showing the structure of an image scanningsystem according to the first embodiment of the present invention.

FIG. 2 is a block diagram showing the structure of a PC according to thefirst embodiment of the first embodiment.

FIG. 3 is a block diagram showing the structure of a scanner accordingto the first embodiment.

FIG. 4A is a flowchart showing steps of the PC's processing according tothe first embodiment.

FIG. 4B is a flowchart showing steps of the PC's processing according tothe first embodiment.

FIG. 5 is a flowchart showing steps of the PC's processing according tothe first embodiment.

FIG. 6 is a flowchart showing steps of the PC's processing according tothe first embodiment.

FIG. 7 is a schematic diagram showing the content of the configurationdata (part file) of an Excel file after the modification by the PCaccording to the first embodiment.

FIG. 8 is a schematic diagram showing the content of the configurationdata (part file) of an Excel file after the modification by the PCaccording to the first embodiment.

FIG. 9 is a schematic diagram showing the content of the configurationdata (part file) of an Excel file after the modification by the PCaccording to the first embodiment.

FIG. 10 is a schematic diagram showing the content of the configurationdata (part file) of an Excel file after the modification by the PCaccording to the first embodiment.

FIG. 11 is a schematic diagram showing the content of the configurationdata (part file) of an Excel file after the modification by the PCaccording to the first embodiment.

FIG. 12 is a schematic diagram showing the content of the configurationdata (part file) of an Excel file after the modification by the PCaccording to the first embodiment.

FIG. 13 is a schematic diagram showing the content of the configurationdata (part file) of an Excel file after the modification by the PCaccording to the first embodiment.

FIG. 14 is a schematic diagram showing the content of the configurationdata (part file) of an Excel file after the modification by the PCaccording to the first embodiment.

FIG. 15 is a schematic diagram showing the content of the configurationdata (part file) of an Excel file after the modification by the PCaccording to the first embodiment.

FIG. 16 is a schematic diagram showing the content of the configurationdata (part file) of the Excel file after the modification by the PCaccording to the first embodiment.

FIG. 17 is a schematic diagram showing the content of the configurationdata (part file) of the Excel file after the modification by the PCaccording to the first embodiment.

FIG. 18 is a schematic diagram showing an Excel file into which scanneddata has been incorporated by an ordinary image scanning system.

FIG. 19 is a schematic diagram showing an Excel file into which scanneddata has been incorporated by an image scanning system according to thefirst embodiment.

FIG. 20 is a schematic diagram showing an Excel file into which scanneddata has been incorporated by an image scanning system according to thefirst embodiment.

FIG. 21 is a network diagram showing the structure of an image formingdevice according to the second embodiment of the present embodiment.

FIG. 22 is a block diagram showing the structure of a PC according tothe second embodiment.

FIG. 23 is a block diagram showing the structure of an image formingapparatus according to the second embodiment.

FIG. 24 is a schematic diagram showing paper documents in the slidemode, the note mode, and the distribution mode.

FIG. 25 is a schematic diagram showing a file to be output when a paperdocument in the note mode is scanned by an image scanning part.

FIG. 26 is a schematic diagram showing a file to be output when a paperdocument in the distribution mode is scanned by an image scanning unit.

FIG. 27 is a flowchart showing steps of the file output processingaccording to the second embodiment.

FIG. 28 is a schematic diagram showing a paper document in the notemode.

FIG. 29 is a schematic diagram showing the templates in the slide mode,the note mode, and the distribution mode.

FIG. 30 is a schematic diagram showing a group of part files of the file“sample.pptx”.

FIG. 31 is an schematic diagram showing the part file“[Content_Types].xml”.

FIG. 32A is a schematic diagram showing the part file“ppt¥notesSlides¥notesSlide1.xml”.

FIG. 325 is a schematic diagram showing the part file“ppt¥notesSlides¥notesSlide1.xml”.

FIG. 33A is a schematic diagram showing the part file“ppt¥Slides¥Slide1.xml”.

FIG. 33B is a schematic diagram showing the part file“ppt¥Slides¥Slide1.xml”.

FIG. 33C is a schematic diagram showing the part file“ppt¥Slides¥Slide1.xml”.

FIG. 33D is a schematic diagram showing the part file“ppt¥Slides¥Slide1.xml”.

FIG. 34 is a schematic diagram showing the part file“ppt¥Slides¥_rels¥Slide1.xml.rels”.

FIG. 35 is a schematic diagram showing a file path to the media folder“ppt¥ppt¥media¥image1.gif”.

FIG. 36 is a schematic diagram showing a display image of the file“sample.pptx”.

DETAILED DESCRIPTION

The embodiments of this invention will be described below with referenceto the accompanying drawings.

The first embodiment of the present invention will be described below.FIG. 1 is a block diagram showing the structure of an image scanningsystem S according to the present embodiment. As can be seen from FIG.1, the image scanning system S contains a PC 1 which serves as a dataprocessing device, and a scanner 2 which serves as an image scanningdevice, and both of them are connected with each other via a network N.

The network N is a LAN complying with a standard such as Ethernet(registered trademark), Token Ring, FDDI, etc., or a WAN with aplurality of LANs connected one another via a dedicated line. The PC 1and the scanner 2 according to the present embodiment can also beconnected directly instead of being connected via the network N. Typeand number of equipments connected to the network N are not limited tothe example shown in the figure.

The structures of the aforementioned equipments are described below indetail. FIG. 2 is a block diagram showing the structure of the PC 1according to the present embodiment. As shown in FIG. 2, the PC 1according to the present embodiment is equipped with a control unit 11,a storage unit 12, a display unit 13, an input unit 14, and aninput/output interface 15, and they are connected via bus 16 forexchanging signals. Details of these components will be described below.

The control unit 11 is a CPU (Central Processing Unit) for controllingoperations of each unit in accordance with control programs andexecuting various arithmetic processing. The storage unit 12 contains aROM (Read Only Memory) for storing control programs of PC 1's basicfunctions and various parameters, a RAM (Random Access Memory) fortemporarily storing various programs and data files to serve as aworking area, a hard disk for storing an OS (i.e. Basic Software),various control program for the particular processing shown below,various parameters, etc.

The display unit 13 is a display device such as an LCD for displayingvarious information to user. The input unit 14 contains a keyboard, amouse, etc., to be used for receiving various operational instructionsfrom user. The input/output interface 15 is an interface forcommunication with other devices on the network N.

The PC 1 according to the present embodiment is equipped with variouskinds of business software programs including a document preparationprogram such as Microsoft Word (registered trademark) and a spreadsheetprogram such as Microsoft Excel (registered trademark). In particular,the PC 1 supports Word and Excel files in the Office OPEN XML (OOXML)format. The PC 1 is provided with a scanner driver, which is a softwareproduct which can offer an user interface (UI) screen for settingoperations of the scanner 2.

FIG. 3 is a block diagram showing the structure of the scanner 2according to the present embodiment. As shown in FIG. 3, the scanner 2according to the present embodiment is equipped with a control unit 21,a storage unit 22, an operation unit 23, an image scanning unit 24, andan input/output interface 25, and they are connected via bus 26 forexchanging signals. Details of these components will be described below.

The control unit 21 is a CPU (Central Processing Unit) for controllingoperations of each unit in accordance with control programs andexecuting various arithmetic processing. The storage unit 22 contains aROM (Read Only Memory) for storing control programs of the basicfunctions of the scanner 2 and various parameters, a RAM (Random AccessMemory) for temporarily storing programs and data files to serve as aworking area, a hard disk for storing a program for controlling theparticular processing of the scanner 2 and various parameters.

The operation unit 23 is an operation panel such as a LCD for displayingan UI screen for showing various information to user as well asreceiving various operational instructions the user. The input/outputinterface 25 is an interface for communication with other devices on thenetwork N.

The image scanner 24 has a function to irradiate either a document seton a certain scanning position on a platen, or a document transported tothe same scanning position by an ADF (Auto Document Feeder) with a lightsource such as a fluorescent lamp, and to scan the reflected light fromthe document with light receiving elements such as CCD and CMOS imagesensors to generate digital data of the document image. Such a series ofoperations is hereinafter referred to as “scanning operation”. Thedigital data created by the scanning operation is hereinafter referredhere as “scanned data” or “scanned data file”.

In the image scanning system S with the aforementioned structure, the PC1 can retrieve more than one scanned data file created by the scanner 2into am Excel file in the OOXML format. More specifically, the PC 1 canreceive user's designation of a spreadsheet (hereinafter also referredto as “sheet”) as a storage destination of each scanned data file bymeans of the UI screen offered by the scanner driver, and store eachscanned data file into user's designated sheet.

The Excel file in the COXML format contains a group of binary data filessuch as text data files in the XML (Extensible Markup Language) andimage data files. Such a group is generally called as a package, andindividual data files within a package are called as part files. The PC1 according to the present embodiment can create an Excel file withuser's desired structure by modifying the part files within the packageaccordingly. In particular, the PC 1 is capable of incorporating imagedata such as scanned data into an Excel file by modifying or adding thepart files shown in Table 1.

TABLE 1 Part file Description ¥[Content_Type].xml This file shows a listof the content types of the part files within the package.¥docProps¥core.xml This file shows document information of the Excelfile (e.g. data and time of recent update). ¥xl¥drawings This file showspositional ¥drawings1.xml information of the image data to beincorporated into an Excel file. This file exists for each image data.¥xl¥drawing¥_rels This file shows relationship between¥drawing1.xml.rels the source part (i.e. “drawing1.xml” ) and its targetresource. ¥xl¥media¥image1.jpeg This file contains the image data (inJPEG format) to be incorporated into an Excel file.

While a newly created Excel file is equipped with three spread sheets,the system can also incorporate scanned data into additional sheetsstarting with the fourth sheet by modifying or adding the part filesshown in Table 2.

TABLE 2 Part file Description ¥[Content_Type].xml This file shows a listof the content types of the part files within the package.¥docProps¥app.xml This file shows application information of the Excelfile as well as sheet information of the additional sheet.¥docProps¥core.xml This file shows document information of the Excelfile (e.g. time and date of recent update). ¥xl¥workbook.xml This fileshows information on book structure of the Excel file. ¥xl¥_rels Thisfile shows relationship between ¥workbook.xml.rels the source part (i.e.“workbook.xml”) and its target resource. ¥xl¥worksheets This file showsinformation on the ¥sheet.xml sheet structure of the Excel file, Thisfile exists for each image data. ¥xl¥worksheets¥_rels This file showsrelationship between ¥sheet.xml.rels the source part (i.e. “sheet.xml”)and its target resource.

As can be seen from the above, the PC 1 according to the presentembodiment is capable of incorporating each scanned data file forstorage into user-designated sheet by modifying the relevant part filesin the package of the Excel file. FIG. 20 illustrates an Excel file tobe created by the PC 1 when “Sheet1” is designated as destination of thescanned data 1 and 2, and “Sheet2” is designated as destination of thescanned data 3.

Next, an overview of the PC 1's operations according to the presentembodiment is shown below. FIG. 4A and FIG. 4B are flowcharts showingsteps of the processing executed by the PC 1 for incorporating thescanned data generated by the scanner 2 for storage in a specifiedformat (hereinafter also referred to as “file storage processing”). Thealgorithm shown in the flowcharts in FIG. 4A and FIG. 4B is stored as acontrol program in the ROM of the storage unit 12 to be read into theRAM when the processing is launched.

Firstly, the PC 1 executes incorporation setting processing (S101) forreceiving user's instructions regarding the incorporation of the scanneddata. Specific steps in the incorporation setting processing (S101) areshown below with reference to FIG. 5. Firstly, the PC 1 displays an UIscreen (not shown) by means of the scanner driver to receiveinstructions regarding a user-specified file format for incorporatingthe scanned data (S201).

In the present embodiment, the file format for incorporating the scanneddata can be any file format as long as it is equipped with a pluralityof display areas (i.e. pages, sheets, etc.) like Word and Excel. Thefollowing explanation assumes that an Excel file in the OOXML format isspecified by user.

After receiving user's instructions regarding the number of scanned datafiles to be generated by the scanner 2, the sheet number of theincorporation destination for each scanned data file, etc. on the UTscreen (S202), the PC 1 returns to the flowchart in FIG. 4A (Return).The following explanation also assumes that the number of the scanneddata files to be generated by the scanner is “3”, and the sheet numbersof the incorporation destinations of these scanned data files (i.e.Scanned data 1, 2, and 3) are Sheet1, Sheet2, and Sheet3, respectively.

With reference to FIG. 4A again, the PC 1 transmits instructionsregarding the scanning operation based on user's incorporation settingsobtained in S101, to the scanner 2 by means of the scanner driver(S102). After that, the PC 1 creates an Excel file in the COXLM format(with file extension “xlsx”) anew, changes its extension to “zip” andthen decompresses the file with the changed extension (S103).

As can be seen from the above, prior to the generation of the scanneddata files by the scanner 2, the PC 1 receives user's instructionregarding the incorporation destination sheet. However, the PC 1according to the present embodiment can also display a preview of thescanned data files on the UI screen after generating them by the scanner2, in order to receive user's instruction regarding the incorporationdestination via the UI screen.

Next, the PC 1 determines whether or not additional sheets starting withthe fourth sheet (Sheet4, . . . ) is designated as the loadingdestinations of the scanned data files with reference to user'sincorporation settings obtained in S101 (S104). The reason for executingthis determination step is because the newly-created Excel file isequipped with 3 sheets alone, and the designation of additional sheetswould entail modification of the relevant part files within the packagefor creating the additional sheets.

The PC 1 then moves onto the steps from S105 to S110 to be describedlater if additional sheets are designated as the loading destination(S104: Yes) while moving directly onto S111 if additional sheets are notdesignated (S104: No). Details of the steps S105 through S110 are shownbelow.

Firstly, the PC 1 adds to the part file “[Content_Type]. xml” in thepackage of the newly created Excel file in S103, the extension of thescanned data files to be incorporated into the Excel file. Morespecifically, supposing that the file format of the scanned data filescreated by the scanner 2 is “JPEG”, such a data string as shown in FIG.7 is be added to the part file “[Content_Type].xml”.

The PC 1 then adds the information regarding the additional sheets toeach of the part files “app.xml” in the folder “¥docProps”, the partfile “workbook.xml” in the folder “¥xl”, and the part file“workbook.xml.rels” in the folder “¥xl¥_rels” (S106, S107 and, S108,respectively). Next, the PC 1 newly creates part files “sheet4.xml”, . .. for the additional sheets, and adds these part files to the folder“¥xl¥worksheets” (S109). Next, the PC 1 newly creates part files“sheet4.xml.rels”, . . . for the additional sheets, and adds these partfiles to the folder “¥xl¥worksheets¥_rels” (S110).

The newly created part files in S109 and S110 (i.e. “sheet4.xml”, . . .and “sheet4.xml.rls”, . . . ) will receive information on the additionalsheets in the scanned data incorporating processing (S112) to bedescribed later.

Next, the PC 1 newly creates a folder “¥xl¥media” for storing thescanned data files as well as the folders “¥xl¥drawings” and“¥xl¥drawings¥_rels” for storing various part files showing informationon the scanned data files, and adds these folders to the package (S111).

Next, the PC 1 repeats the scanned data incorporation processing (S112)to be described later for each scanned data file created by the scanner2. After finishing the scanned data incorporation processing in S112 forall the scanned data files, the PC 1 then rewrites the update time forthe part file “core.xml” in the folder “¥docProps” (S113). Morespecifically, in S113, the relevant portion in the part file“¥docProps¥core.xml” is replaced with the data string shown in FIG. 9.

The PC 1 then zips the package after the modification in the steps fromS103 to S113, and stores it after changing its file extension to “xlsx”(S114). The storage destination of the zipped file can be either thestorage unit 12 of the PC 1 or the storage unit 22 of the scanner 2, oreven an external storage device connected to the network N. After that,the PC 1 finishes the file storage processing (End).

Specific steps in the scanned data incorporation processing in S112 aredescribed below with reference to FIG. 6. Firstly, the PC 1 stores thescanned data files generated by the scanner 2 to the folder “¥xl¥media”created in S111 (S301). The PC 1 adds the content type of the scanneddata files stored in S301 to the part file “[Content_Type].xml” in thepackage (S302). More specifically, the step S302 involves modificationof the part file “[Content_Type].xml” in accordance with the followingsteps (a) and (b).

(a) Adding the extension of the scanned data files. More specifically,such a data string as shown in FIG. 7 is added if the file format of thescanned data files created by the scanner 2 is “JPEG”. Nevertheless, thestep (a) is skipped if the aforementioned S105 has already beenperformed.

(b) Adding the part name (i.e. “Part Name”) and the content type (i.e.“ContentType”) of the scanned data. For example, such a data string asshown in FIG. 8 is added as a result of repetition of the step (b) forthe scanned data 1 to 3. The part name (i.e. “PartName”) can be anycharacter string while the FIG. 8 illustrates a default which is made ofa combination of the character string “drawing” and the serial number.The content type (i.e. “ContentType”) is a predefined value for eachcontent type.

Next, the PC 1 specifies the sheet number of the scanned data'sincorporation destination (S303) by referring to the incorporationsettings obtained in S101. The sheet number specified in S303 ishereinafter referred to as “N” for the sake of convenience. The PC 1then determines whether or not data incorporation into SheetN is thefirst time ever (S304), and branches the subsequent steps in accordancewith the determination result.

If the data incorporation into SheetN is the first time (S304: Yes), thePC 1 creates the part file “drawingN.xml” which describes positionalinformation of the scanned data to be incorporated into SheetN, and addsthe part file to the folder “¥xl¥drawings” which has been created inS110. The positional information of the scanned data to be incorporatedinto SheetN is thus added to the part file “drawingN.xml” in the folder“¥xl¥drawings” (S305). More specifically, such a data string as shown inFIG. 13 is added to the part file in S305.

Next, the PC 1 newly creates a part file “drawingN.xml.rels” whichdescribes relationship of the part file “drawingN.xml”, and adds thepart file to the folder “¥xl¥drawings¥rels” which has been created inS110. The PC 1 then adds the relationship with the target resource (i.e.scanned data file) to the part file “drawingN.xml.rels” in the folder“¥xl¥drawings¥rels” (S306).

More specifically, such data strings as shown in FIGS. 10, 11, and 12are added to the relationship part file (i.e. “drawingN.xml.rels”) inS306. FIGS. 10, 11, and 12 correspond to N=1, 2, and 3 respectively. The“Relationship Id” shown in each drawing is the identification number ofeach relationship, and is an unique character string within eachrelationship part file (Same in FIGS. 14, 15, and 16). The “Type” hereinrefers to type of relationship, and is a predefined value for each typeof the target part files (Same in FIGS. 14, 15, and 16). The “Target”herein refers to data path to the target part file i.e. the scanned datafile in this example (Same in FIGS. 14, 15, and 16). It can be seen fromFIGS. 10, 11, and 12 that the target resources corresponding to thesource part files “drawing1”, “drawing2”, and “drawing3” are“image1.jpeg”, “image2.jpeg”, and “image3.jpeg”, respectively.

On the other hand, if the incorporation into SheetN is not the firsttime (S304: No), the PC 1 adds positional information of the scanneddata file to be incorporated into SheetN, to the part file“drawingN.xml” which has been added to the folder “¥xl¥drawings” (S307).The PC 1 then adds the relationship with the scanned data file to beincorporated into SheetN, to the part file “drawingN.xml.rels” which hasbeen added to the folder “¥xl¥drawing¥_rels” (S308).

The PC 1 then adds the relationship between SheetN and the scanned datafile to be incorporated into SheetN, to the part file“drawingN.xml.rels” in the folder “¥xl¥drawing¥_rels” (S309). Forexample, such data strings as shown in FIGS. 14, 15, and 16 are added tothe relationship part file (i.e. “sheetN.xml.rels”) in S309, FIGS. 14,15, and 16 correspond to N=1, 2, and 3, respectively.

Next, the PC 1 adds the ID information of the relationship which hasbeen added to the part file “sheetN.xml.rels” in S309, to the part file“sheetN.xml” in the folder “¥xl¥worksheets” (S310). FIG. 17 illustratescontent of the part file “sheetN.xml” which has received the IDinformation in S310. In this example, as the relationship ID to be addedto the part file “sheetN.xml” in S309 is “rId2” regardless of “N” (SeeFIGS. 14, 15, and 16), the letter string “<drawing r:id=“rId2”/>” isadded in S310 as the ID information of the relationship (See the italicsportion in FIG. 17).

After that, the PC 1 repeats the same processing for the remainingscanned data files, and returns to the flowchart of FIG. 4B when itfinishes all the scanned data files. As a result of modifying therelevant part files in the package in accordance with the flowchart inFIG. 4B (See FIG. 7-17), the scanned data 1, 2, and 3 created by thescanner 2 will be allocated to the user-designated spread sheets asshown in FIG. 19 (i.e. Sheet1, Sheet2, and Sheet3, respectively).

As can be seen from the above, the PC 1 according to the presentembodiment for incorporating one or more scanned data files generated bythe scanner 2 for storage into an Excel file in the OOXML format, iscapable of modifying the relevant part files of the Excel file so thatthe scanned data files will be allocated to user-designated displayareas (i.e. spreadsheets). Therefore, the present embodiment cansubstantially reduce the burden on user who tries to make efficient useof the data files incorporated into an Excel file.

The Image scanning device according to the present embodiment can alsobe an MFP (multifunction Peripheral) equipped with printing and copyingfunctions in addition to a scanning function while the presentembodiment uses the scanner 2 as an example. Furthermore, the dataprocessing device according to the present embodiment can also be abuilt-in device of an image forming device with a scanning function suchas a MFP. This means that the present embodiment also cover the aspectof the present invention where a single image forming device performsall the steps of creating scanned data, incorporating the scanned datainto a file in a specified format, and finally storing the file into aninternal storage device such as a HDD or an external storage device suchas a USB memory storage device, by itself.

The file format of the incorporation destination according to thepresent invention can be any file format equipped with a plurality ofdisplay areas such as spreadsheets and pages although the presentembodiment uses an Excel file and a Word file in the OOXML format asexamples.

The second embodiment of the present invention will be described below.FIG. 21 is a block diagram showing the structure of the image formingdevice 4 which serves as a data processing device according to thepresent embodiment.

The image forming device 4 is connected to a communication line 6 sothat it can communicate with the PC 5. The communication line 6 createsa network between the image forming device 4 and the PC 5. Thecommunication line 6 can conform to any communication method as long itensures connection between the PC 5 and the image forming device 4. Forexample, the communication line 6 can be a wired network using anEthernet (registered trademark) cable, a coaxial cable, optical fiberetc., a wireless network based on various standards, or any combinationof these wired and wireless communication methods. The communicationline 6 can also be LAN (Local Area Network), Internet, or any othernetwork in an arbitrary scale.

FIG. 22 is a block diagram showing the structure of the PC 5. The PC 5is equipped with a CPU (Central Processing Unit) 51, a RAM (RandomAccess Memory) 52, a ROM (Read Only Memory) 53, a storage unit 54, aninput interface 55, an output interface 56 and a communication device57, all of which are connected by a bus 50.

The CPU 51 cooperates with the programs stored in the ROM 53, andcontrols the operations of the PC 5 in accordance with the programs anddata read into the RAM 52. The RAM 52 stores various data created as aresult of the processing of the CPU 51 as well as temporary datagenerated in the course of the same processing. The ROM 53 stores theprograms and data retrieved by the CPU 51.

The storage unit 54 stores the programs and data retrieved by the CPU51. The storage unit 54 is a rewritable storage unit formed by acombination of a flash memory, a hard disk drive, and any otherrewritable storage device.

The input interface 55 is an interface for receiving an input from anexternal input device 58. The external input device 58 is typically akeyboard and a mouse, and is used to receive user's manual input.

The output interface 56 is an interface for sending an output to anexternal output device 59. The external output device 59 is typically adisplay device such as a CRT or a LCD for displaying an output screenbased on the processing result of the CPU 51.

The communication device 57 makes a connection between the PC 5 and anexternal communication network (e.g. communication line 6) to enable thePC 5 to communicate with an external equipment. The communication device57 is typically a NIC (Network Interface Card), and is capable of makinga connection in accordance with various types of communication methods.

FIG. 23 is a black diagram showing the structure of the image farmingapparatus 4. The image forming devices 4 shown in FIG. 21 have the samestructure, and therefore the following explanation is common to all ofthem. The image forming device 4 is equipped with a determination unit,a CPU 41 which functions as a control unit, a RAM 42, a ROM 43, astorage unit 44, an input interface 45, an image scanning unit 46 whichfunctions as an acquisition unit, an image printing unit 47, and acommunication device 48, all of which are connected via the bus 40.

The CPU 41 cooperates with the programs stored in the ROM 43, andcontrols the operations of the image forming device 4 in accordance withthe programs and data read into the RAM 42. The RAM 42 stores the datacreated as a result of the processing of the CPU 41 as well as temporarydata generated in the course of the same processing. The ROM 43 storesthe programs and data retrieved by the CPU 41.

The storage unit 44 stores the programs and data retrieved by the CPU41. The storage unit 44 is a rewritable storage unit formed by acombination of a flash memory, a hard disk drive, and any otherrewritable storage device.

The input interface 45 is an interface for receiving an input from aninput device such as an external input device 49. The external inputdevice 49 is typically an operation panel with a touch screen whichallows user to enter various instructions.

The image scanning unit 46 is equipped with an ADF unit, a platen glass,and an optical system such as CCD image sensors, realizing a function toscan a document image placed on the ADF or the platen glass by theoptical system. The image data obtained by scanning the image documentwith the image scanning unit 46 (i.e. analogy image signals) is put intoA/D conversion and various image processing, before being stored intothe storage unit 44 in the form of digital image data (or an image datafile) and being output to the image printing unit 47.

The image printing unit 47 executes image forming (i.e. printprocessing) based on the input image data. The printing method used bythe image printing unit 47 can be the electronic photography method, theink-jet method, the thermal transfer method, the offset method, etc. Inthe present embodiment, the image printing unit 47 performs imageforming by means of the electronic photography method.

The communication device 48 makes a connection between the image formingdevice 1 and an external communication network (e.g. communication line6) to enable the image forming device 4 to communicate with externalequipment. The communication device 48 is typically a NIC (NetworkInterface Card), and is capable of making connection in accordance withvarious types of communication methods.

Next, the image layouts of a paper document to be scanned by the imagescanning unit 46 will be described below with reference to FIG. 24. Thefollowing is an explanation of the image layout in the case whereMicrosoft PowerPoint (registered trademark) is used as an applicationsoftware (hereinafter referred to as “application”) to print out thepaper document. As shown in FIG. 24, the image layout of the paperdocument can be a slide mode (See FIG. 24 (A)), a note mode (See FIG. 24(b)), or a distribution mode (See FIG. 24 (C).

A paper document in the slide mode takes a form of a one-page slideimage. The slide image herein refers to an image formed on a PowerPointslide, and it also contains various images such as letters, lines,tables, figures, and photographs. A paper document in the note modetakes a form of a one-page slide image and a note image. The note imageconsists of a text image such as a memorandum concerning the slideimage. A paper document in the distribution mode takes a form of slideimages for a plurality of pages.

Next, the process flow for scanning a paper document to output thescanned paper document in an electronic file which can be edited byPowerPoint is shown below. When a paper document in the note mode isscanned by the image scanning unit 46 as shown in FIG. 25, for example,image data (or an image data file) consisting of a one-page slide imageand a note image is acquired. Vector data is then generated based on theacquired image data. The generate vector data is then output as a filein the note mode. In other words, the output file in the note mode isstored in the storage unit 44. When user enters an instruction to edit afile in the note mode, the file in the note mode is read out from thestorage unit 44 to be displayed on the touch screen of the externalinput device 49 (See FIG. 25). The file in the note mode can also betransmitted to the PC 5 via the communication device 48 to be displayedon the external output device 59. This will allow user to edit a file inthe note mode, thereby enhancing convenience for user.

When a paper document in the distribution mode is scanned by the imagescanning unit 46 as shown in FIG. 26, image data (or an image data file)consisting of slide images for a plurality of pages is acquired. Vectordata is then generated based on the acquired image data. The generatedvector data is then stored as a file in the distribution mode into thestorage unit 44. When user enters an instruction to edit a file in thedistribution mode, the file in the distribution mode is read out fromthe storage unit 44 to be displayed on the touch screen of the externalinput device 49 or on the external output device 59 (See FIG. 26). Thiswill allow user to edit a file in the distribution mode, therebyenhancing convenience for user.

Moreover, when a paper document in the slide mode is scanned by theimage scanning unit 46, image data (or an image data file) consisting ofa one-page slide image is acquired (not shown), and a file in the slidemode is output in accordance with the procedure illustrated in FIG. 25and FIG. 26. This will allow user to edit a file in the slide mode,thereby enhancing convenience for user.

Next, the file output processing will be described below with referenceto FIG. 27. The file output processing includes acquiring image data byscanning a paper document with the image scanning unit 46, determiningwhether or not the image layout of the acquired image data matches withthe image layout of a predefined template, generating vector data fromthe acquired image data, and outputting the generated vector data in thematching image layout with the template.

The following explanation assumes that a paper document in the note mode(See FIG. 28) is placed on the ADF or on the platen glass. As shown inFIG. 28, the slide image in the paper document in the note mode includesa text image, a picture image, and a graphic image. The note image inthe paper document also includes a text image (a text image within anote image is hereinafter referred to as “a text image (note)”).

The following explanation also assumes that a selection screen forallowing user to select an application to be used is displayed on thetouch screen of the external input device 49 beforehand, and userselects “PowerPoint” as the application to be used via the externalinput device 49.

The file output processing (See FIG. 27) by the image forming device 4can be triggered by user's depression of the scan button on the externalinput device 49 for initiating the scanning of a paper document, forexample.

Firstly, the image data acquisition is performed (S401). In other words,the image forming device 4 scans the paper document by the imagescanning unit 46 to acquire image data. The acquired image data (i.e.analog image signals) is put into A/D conversion. Then, various imageprocessing is applied to the image data after the A/D conversion, andthe digital image data after the image processing is stored into thestorage unit 44.

After the execution of S401, the image data stored in the storage unit44 is retrieved, and a matching check of the retrieved image data iscarried out by using a predefined template (S402). The matching check isintended to determine whether or not the image layout of the retrievedimage data matches with the template. Details of the predefined templateare shown below with reference to FIG. 29. The template in the presentembodiment is a typical file used for identifying the image layout ofthe retrieved image data. If the matching check reveals that the imagelayout of the retrieved image data matches with the template, the imagelayout of the template is determined as the image layout of theretrieved image data. In the present embodiment, three differenttemplates (i.e. a template in the slide mode, a template in the notemode, and a template in the distribution mode) are stored in the storageunit 44 to be used for the matching check.

The template in the slide mode (See FIG. 29 (A)) consists of image dataof a one-page slide image. When the image data matches with the templatein the slide mode, the slide mode is determined as the image layout ofthe image data. The template in the note mode (See FIG. 29 (B)) consistsof image data of a one-page slide image and image data of a note image.When the image data matches with the template in the note mode, the notemode is determined as the image layout of the image data. The templatein the distribution mode (See FIG. 29 (C)) consists of image data ofslide images for a plurality of pages. When the image data matches withthe template in the distribution mode, the distribution mode isdetermined as the image layout of the image data. Meanwhile, the imagedata area of the one-page slide image is defined as “main body area” ofthe template, and the image data area of the note image is defined as“note area” of the template. The main body area and the note area arealso defined generically as “template data area”.

The following is an explanation of the matching check in the case wherea paper document in the note mode (See FIG. 28) is scanned, and imagedata consisting of a one-page slide image and a note image is acquired.The matching check is intended to compare the acquired image data withthe template data area, to calculate a score showing matching degreebetween the image layout of the image data and the image layout of eachtemplate by checking whether or not the image data fits into thetemplate area and to determine the image layout based on the calculatedscore.

Firstly, the acquired image data and the template data area are comparedwith each other, and it is determined whether or not the image data fitsinto the template data area. The acquired image data herein consists ofa one-page slide image and a notebook image. In this case, it isdetermined whether or not the one-page slide image and the note image inthe acquired image data fit into the template area of each template.

In this example, the image data and the template area (i.e. the mainbody area and the note area) in the note mode (See FIG. 29 (B)) arecompared with each other, and it is determined that the image data fitsinto the main body area and the note area of the template as a wholewhen the one-page slide image and the note image fit into the main bodyarea and the note area of the template, respectively. In this case, “10points” are scored as the image data fits into the main body area, andadditional “10 points” are scored as the image data also fits into thenote area. In other words, “20 points” are scored in total for thetemplate in the note mode.

Similarly, the image data and the template area (i.e. the main, bodyarea) in the slide mode (See FIG. 29 (A)) are compared with each other,and “0 point” is scored for the template in the slide mode if it isdetermined that the one-page slide image and the note image data of theimage data does not fit into the main body area of the template.

Similarly, the image data and the template area (i.e. the main bodyarea) in the distribution mode (See FIG. 29 (C)) are compared with eachother, and “0 point” is scored for the template in the distribution modeif it is determined that the one-page slide image and the note image inthe image data does not fit into the main body area of the template.After calculating these scores, it is determined that image layout ofthe image data matches with the template with the highest score (i.e.the note mode).

The aforementioned matching check can also involve user's selection of atemplate in the case where the calculated scores are below a certainlevel. More specifically, selection information showing the image layoutof each template for user's selection can be displayed on the touchscreen of the external input device 49 for user's selection of thetemplate if the calculated scores are below a certain level. Theselection information can also be transmitted to the PC 2 via thecommunication device 48 to be displayed on the external output device59.

With reference to FIG. 27 again, discrimination of the matching checkresult is carried out after the execution of S402 (S403). If thediscrimination of the matching check result reveals that the imagelayout of the image data matches with the template in the slide mode(S403: Slide), the OCR processing is executed for the main body area ofthe image data (S404). If the discrimination of the matching checkresult reveals that the image layout of the image data matches with thetemplate in the note mode (S403; Note), the OCR processing is executedfor the main body area and the note area of the image data (S405). Ifthe discrimination of the matching check result reveals that the imagelayout of the image data matches with the template in the distributionmode (S403; Distribution), the OCR processing is executed for the mainbody area of the image data (S406). The main body area of the image dataherein refers to the data area of the image data which corresponds tothe main body area of the template. The note area of the image dataherein refers to the data area of the image data which corresponds tothe note area of the template. The image data can be classified intoeither the main body area or the note area, based on the presence orabsence of a frame in the image data itself. As a result of the OCRprocessing in the steps from S404 to S406, vector data is generated fromthe image data.

After the execution of the steps from S404 to S406, the file outputprocessing is executed (S407). The file output processing is intended tooutput the vector data in the image layout of the template with whichthe image layout of the acquired image data matches. More specifically,the file output processing includes steps of appending the vector datato an OOXML file, which can be handled by Microsoft Word (registeredtrademark), Microsoft Excel (registered trademark), Microsoft PowerPoint(registered trademark), etc., generating a file in the image layout ofthe template with which the image layout of the acquired image datamatches, and storing the generated file to the storage unit 44.

OOXML is a newly adopted file storage format (or a file format) inMicrosoft Office (registered trademark) 2007. This means that OOXML isgenerally used as a file format for Word, Excel, Powerpoint, etc.Therefore, an OOXML file can be handled (i.e. edited) by applicationssuch as Word, Excel, and Powerpoint.

The OOXML extensions for Word, Excel, and Powerpoint are “dccx”, “xlsx”,and “pptx”, respectively. User can view the content of an OOXML file“sample.pptx” by changing its extension “pptx” into “zip” and thendecompressing the same file with the changed extension “zip” using adecompression software, for example. The sample OOXML file “sample.pptx”consists of a plurality of part files as shown in FIG. 30.

FIG. 31-35 show the relevant part files to the data addition for thefile generation in the image layout of the template with which the imagelayout of the image data matches as well as the data adding positionswithin the relevant part files. The following examples show the relevantpart files to the data addition for the file generation as well as thepositions of the data addition within the relevant part files, based onthe assumption that the note mode is the image layout of the templatewith which the image layout of the image data matches.

If the type of the image data shown in FIG. 28 is “image1.gif” forexample, its content-type data needs to be added to the part file“[Content_Types].xml” (See FIG. 31) among a plurality of parts files,which consists of file definition information. More specifically, thedata string indicating the content type (i.e. “gif”) is added to thepart file “[Content_Types].xml” (See the added portion 1 in FIG. 31).

Moreover, the vector data resulted from the OCR processing of the notearea needs to be added to the part file“ppt¥notesSlides¥notesSlide1.xml” (See FIG. 32A and FIG. 32B) among aplurality of parts files, which consists of various data regarding thenote area. More specifically, the vector data (i.e. the text portion inFIG. 32B) resulted from the OCR processing of the note area of the imagedata is added to the part file “ppt¥notesSlides¥notesSlide1.xml.” Thetext portion in FIG. 32B corresponds to the vector data generated fromthe text image (note) shown in FIG. 28.

Moreover, the vector data resulted from the OCR processing of the mainbody area of the image data needs to be added to the part file“ppt¥notesSlides¥notesSlide1.xml” (See FIGS. 33A, 33B, 33C, and 33D)among a plurality of part files, which consists of various dataregarding the main body area. More specifically, the vector dataresulted from the OCR processing of the main body area of the image data(i.e. the text portion shown in FIG. 33A, the picture portion shown inFIG. 33B, and the graphic portion shown in FIG. 33C) should is added tothe part file “ppt¥Slides¥Slide1.xml”. The text portion in FIG. 33Acorresponds to the vector data generated from the text image shown inFIG. 28, the picture portion in FIG. 33B corresponds to the vector datagenerated from the picture image shown in FIG. 28, and the graphicportion of FIG. 33C corresponds to the vector data generated from thegraphic image shown in FIG. 28.

The relationship regarding the structure of the main body area needs tobe added to the part file “ppt¥Slides¥_rels¥Slide1.xml.rels” (See FIG.34) among a plurality of part files, which consists of variousrelationship data. More specifically, the data showing the relationshipof the part file “image1.gif” (i.e. Relationship ID and file path in theadded portion 2 shown in FIG. 34) is added to the relevant part file.The added portion 2 indicates that the picture data with therelationship ID “rId3” shown in the picture portion in FIG. 33B isstored in the media folder as “image1.gif”. The media folder consists ofpicture image files. For example, the file path to the “image1.gif” willbe “ppt¥ppt¥media¥image1.gif” when it is stored in the media folder (SeeFIG. 35).

As can be seen from the above, the addition of the vector data of themain body area and the note area to the file “sample.pptx” (i.e. theaddition of the vector data to the relevant part files among a pluralityof part files within “sample.pptx”) results in the generation of thefile “sample.pptx” in the notebook mode.

The generated file “sample.pptx” in the note mode is stored into thestorage unit 44. When user enters an instruction to edit the file“sample.pptx” in the notebook mode, the file “sample.pptx” in the notemode is retrieved from the storage unit 44 to be displayed on the touchscreen of the external input device 49 or on the external output device59 (See FIG. 36). This allows user to view the file “sample.pptx” in thenote mode, and therefore to edit the file “sample.pptx” via the externalinput device 49 or the external input device 58 using PowerPoint.

As shown in the above, the present embodiment includes determiningwhether or not the image layout of the acquired image data matches withthe image layout of the template by comparing the acquired data and thetemplate area in the template in order to determine whether or not theacquired image data fits into the template area. Thus, the presentembodiment ensures high accuracy in determining whether or not the imagelayout of the image data matches with the image layout of the template.The present embodiment also includes classifying the image data acquiredby the image scanning unit 46 into either the main body area or the notearea based on the presence or absence of a frame. Therefore, the presentembodiment ensures high accuracy in classifying the image data even whenthe text portion contains a plurality of areas with different attributes(e.g. main body area and the note area). The present embodiment alsoincludes generating the vector data from the image data, and outputtingthe generated vector data in the image layout of the template with whichthe image layout of the image data matches, thereby allowing user toedit the file using PowerPoint. The present embodiment hence improvesconvenience for user.

The present embodiment also includes calculating a score showing thematching degree of the image layout between the image data and thetemplate, and determining that the image layout of the image datamatches with the image layout of the template with the highest score,thereby ensuring high accuracy in the matching check of the image layoutbetween the image data and the template.

The present embodiment also includes displaying the selectioninformation for allowing user to select the image layout of the templateon the touch screen of the external input device 49 or the externaloutput display 59 if the calculate scores are below a certain level.Thus, the present embodiment allows user to select the image layout ofhis/her desired template if the calculated scores are below a certainlevel.

The present embodiment also includes adding the vector data to the file“sample.pptx” in the OOXML format to generate the file “sample.pptx” inthe note mode. The present embodiment also includes displaying the file“sample.pptx” in the note mode on the touch screen of the external inputdevice 49 or the external output device 59, thereby allowing user toview the file “sample.pptx” in the note mode and to edit the“sample.pptx” via the external input device 49 or the external inputdevice 58 using PowerPoint.

The image forming device 4 according to the present embodiment can alsoreceive the image data created on the PC 5 via the communication device48, instead of acquiring it by scanning a paper document with the imagescanning unit 46 as illustrate above. The application software accordingto the present embodiment is not limited to Powerpoint in spite of theexplanations set forth above. This means that various other applications(e.g. Word) can also be used for PowerPoint. Furthermore, a plurality oftemplates in different modes (e.g. a N-up mode template) can also beused.

Moreover, the data processing device according to the present inventioncan also be implemented by a dedicated hardware circuit for executingthe aforementioned steps, or a program executed by the CPU to performthe aforementioned steps. If the present invention is implemented by thelatter means, the control program of the data processing device can takea form of a computer readable recording medium such as a floppy(registered trademark) disk or CD-ROM, or a downloadable program filesupplied on-line via a network such as Internet. In the former case, theprogram recorded on the computer readable recording medium is normallytransmitted to a memory unit such as a ROM or a hard disk. The controlprogram can also take a form of an application software program or abuilt-in function of the data processing device.

What is claimed is:
 1. A data processing device for creating a file in aspecified format in which a plurality of scanned data files generated byan image scanning device by reading out a document are allocated toeither a plurality of pages or a plurality of sheets, the dataprocessing device comprising: a reception unit for receiving user'sdesignation of one of said plurality of pages or plurality of sheets towhich each of said plurality of scanned data files is allocated beforesaid plurality of scanned data files are generated by said imagescanning device; and a data creating unit for creating a file in thespecified format in which each of said plurality of scanned data filescreated by said image scanning device are allocated to said one of pagesor sheets which is designated corresponding to the allocated scanneddata file; wherein said pages or sheets are spreadsheets; and whereinwhen sheets starting with a fourth sheet in said specified format isdesignated as a spreadsheet in which the relevant scanned data file isallocated, said sheets starting with the fourth sheet are added.
 2. Thedata processing device as claimed in claim 1, wherein said receptionunit receives said user's designation of number of pieces of saidplurality of scanned data files to be generated and said user'sdesignation, before said scanned data files are generated by said imagescanning device, of said one of said plurality of pages or sheets towhich each of said plurality of scanned data files is allocated.
 3. Thedata processing device as claimed in claim 1 wherein said specifiedformat is Microsoft Excel(registered trademark).
 4. The data processingdevice as claimed in claim 1, wherein said specified format is MicrosoftExcel(registered trademark), and said plurality of pages or plurality ofsheets are pages.
 5. A data processing method for creating a file in aspecified format in which a plurality of scanned data files generated byan image scanning device by reading out a document are allocated toeither a plurality of pages or a plurality of sheets, the methodcomprising: (1) receiving user's designation of one of said plurality ofpages or plurality of sheets to which each of said plurality of scanneddata files is allocated before said plurality of scanned data files aregenerated by said image scanning device; and (2) creating a file in thespecified format in which each of said plurality of scanned data filescreated by said image scanning device are allocated to said one of pagesor sheets which is designated corresponding to the allocated scanneddata file said pages or sheets are spreadsheets; and wherein when sheetsstarting with a fourth sheet in said specified format is designated as aspreadsheet in which the relevant scanned data file is allocated, saidsheets starting with the fourth sheet are added.
 6. The data processingmethod as claimed in claim 5, wherein user's designation of number ofpieces of said plurality of scanned data files to be generated and saiduser's designation is received, before said scanned data is generated bysaid scanning device, of said one of said plurality of pages or sheetsto which each of said plurality of scanned data files is allocated. 7.The data processing method as claimed in claim 5 wherein said specifiedformat is Microsoft Excel(registered trademark).
 8. The data processingmethod as claimed in claim 5, wherein said specified format is MicrosoftWord(registered trademark), and said plurality of pages or plurality ofsheets are pages.
 9. A data processing device comprising: an acquisitionunit for acquiring image data; an input unit in which selection ofapplication software is input by a user; a determination unit for, bycomparing said image data acquired by said acquisition unit with aplurality of templates corresponding to said application softwareselected by the user, automatically determining which of said pluralityof templates matches with said image data; and a control unit forgenerating vector data from said image data, and controlling said dataprocessing device so that said generated vector data is output as a filein a format corresponding to said template which corresponds to saidapplication software selected by the user and is automaticallydetermined by said determination unit as matching with said image data;wherein said plurality of templates includes at least one of first dataarea with its outer edge being surrounded by a frame, and second dataarea with its outer frame not being surrounded by a frame; and saiddetermination unit classify said image data into data area correspondingto said first data area or data area corresponding to said second dataarea based on presence or absence of said frame within said image data.10. The data processing device as claimed in claim 9, wherein saiddetermination unit calculates a score of matching degree between saidimage data and a format of said template based on check result as towhether or not said image data fits into template data area of saidtemplate, and determines whether or not the format of said image datamatches with the format of said template based on said calculated score.11. The data processing device as claimed in claim 10, wherein saidcontrol unit controls said data processing device so that selectioninformation for allowing user to select layout format of said templateis output to an output unit if said calculated score is below a certainlevel.
 12. The data processing device as claimed in claim 9, whereinsaid control unit appends said vector data to a file in a specifiedformat corresponding to a specified application software selected by theuser, and generates a file in a format corresponding to said templatewith which said determination unit determines that said templatematches.
 13. The data processing device as claimed in claim 9, whereinsaid application software selected by the user is one of Microsoft Word(registered trademark), Microsoft Excel (registered trademark), andMicrosoft PowerPoint (registered trademark).
 14. The data processingdevice as claimed in claim 10, wherein said specified format is OOXML.15. A data processing method comprising: acquiring image data with anacquisition unit; inputting selection of application software with aninput unit; determining with a determination unit, by comparing saidimage data with a plurality of templates corresponding to saidapplication software, which of said plurality of templates matches withsaid image data; and generating, with a control unit, vector data fromsaid acquired image data, and outputting said generated vector data as afile in a format corresponding to said template which corresponds tosaid application software and is determined as matching with said imagedata; wherein said plurality of templates includes at least one of firstdata area with its outer edge being surrounded by a frame, and seconddata area with its outer frame not being surrounded by a frame; and saiddetermination unit classify said image data into data area correspondingto said first data area or data area corresponding to said second dataarea based on presence or absence of said frame within said image data.16. The data processing device as claimed in claim 15, wherein saiddetermination unit calculates a score of matching degree between aformat of said image data and a format of said template based on checkresult as to whether or not said image data fits into template data areaof said template, and determines whether or not the format of said imagedata matches with the format of said template based on said calculatedscore.
 17. The data processing device as claimed in claim 16, whereinsaid control unit controls said data processing device so that selectioninformation for allowing user to select layout format of said templateis output to an output unit if said calculated score is below a certainlevel.
 18. The data processing device as claimed in claim 16, whereinsaid file to be output is OOXML.
 19. The data processing device asclaimed in claim 15, wherein said control unit appends said vector datato a file in a specified format corresponding to a specified applicationsoftware selected by the user, and generates a file in a formatcorresponding to said template with which said determination unitdetermines that said template matches.
 20. The data processing device asclaimed in claim 15, wherein said application software selected by theuser is one of Microsoft Word (registered trademark), Microsoft Excel(registered trademark), and Microsoft PowerPoint (registered trademark).21. A data processing device comprising: an acquisition unit foracquiring image data; a determination unit for determining whether ornot image layout of said acquired image data matches with a predefinedtemplate, by comparing said acquired image data with template data areaof said template in order to determine whether or not said acquiredimage data fits into said template data area; and a control unit forgenerating vector data from said image data, and controlling said dataprocessing device so that said generated vector data is output in imagelayout of said template with which said determination unit determinesthat image layout of said acquired image data matches; wherein saiddetermination unit calculates a score of matching degree between imagelayout of said image data and image layout of said template based oncheck result as to whether or not said image data fits into saidtemplate data area, and determines whether or not image layout of saidimage data matches with image layout of said template based on saidcalculated score; wherein said control unit controls said dataprocessing device so that selection information for allowing user toselect image layout of said template is output to an output unit if saidcalculated score is below a certain level.